How to Program a Parallel Computer?
نویسنده
چکیده
The aim of this survey is to show that efficient programmerfriendly parallel computation is possible, under certain general assumptions. However, a lot of work remains to be done before parallel computation is everyday practice. The main reason limiting the efficiency of the sequential computer is the so called von Neumann bottleneck. As all data is processed in a single processor, it becomes a bottleneck of execution. The speed of processors has grown tremendously — now we have processors executing more than 10 instructions per second. Also the bandwidth (i.e. the number of bits per second that can be moved between two points) of data communication has grown enormously — we can speak about transfer rates of terabits per second. However, increased processor speed and bandwidth are not enough. As the speed of light is constant, a datum farther than 15 cm cannot be fetched within a clock cycle. Also the speed of memories has not grown as fast as the speed of processors. Thus getting a datum from memory takes the time of tens of instructions. This is the latency problem. The von Neumann bottleneck is removed by using more than one processor. Ideally, by p processors we should be able to do p times more than one processor, assuming that processors can work independently. However, if we should speedup a single computation, getting p-fold efficiency is challenging. By the theory of parallel algorithms we know that many common problems have algorithms that consist of a large number of parallel threads, and thus they can utilize a large number of processors. Highly parallel algorithms also solve the latency of memory access, using the slackness principle. In this talk, we shall first show, how a lot of parallel threads can be found in problems that seem sequential at first sight. Then we shall show, how slackness principle is applied to hide the latency. Finally, we shall consider computer architectures that have enough bandwidth to allow efficient execution of parallel algorithms.
منابع مشابه
Evaluation of scheduling solutions in parallel processing using DEA FDH model
This paper gives a new application of DEA to evaluate the scheduling solutions of parallel processing. It evaluates the scheduling solutions of parallel processing using the non-convex DEA model, FDH model. By introducing each solution of parallel processing scheduling as a DMU with some relevant inputs and outputs this paper shows that how the most efficient schedule(s) can be identified.
متن کاملSpeeding up the Stress Analysis of Hollow Circular FGM Cylinders by Parallel Finite Element Method
In this article, a parallel computer program is implemented, based on Finite Element Method, to speed up the analysis of hollow circular cylinders, made from Functionally Graded Materials (FGMs). FGMs are inhomogeneous materials, which their composition gradually varies over volume. In parallel processing, an algorithm is first divided to independent tasks, which may use individual or shared da...
متن کاملAn Intelligent Computer Interface Utilizing Parallel Picocontrollers (TECHNICAL NOTE)
The design of an interface unit is described, in which RS232 serial data is converted to latched parallel data on 22 independent lines. The data direction of each line is programmable through the serial port. Two picocontrollers are employed in a parallel processing mode to give the required number of I/O pins, and data on the shared serial line is coded to separate data streams to the individu...
متن کاملParallel Spatial Pyramid Match Kernel Algorithm for Object Recognition using a Cluster of Computers
This paper parallelizes the spatial pyramid match kernel (SPK) implementation. SPK is one of the most usable kernel methods, along with support vector machine classifier, with high accuracy in object recognition. MATLAB parallel computing toolbox has been used to parallelize SPK. In this implementation, MATLAB Message Passing Interface (MPI) functions and features included in the toolbox help u...
متن کاملA New Approach to Solve N-Queen Problem with Parallel Genetic Algorithm
Over the past few decades great efforts were made to solve uncertain hybrid optimization problems. The n-Queen problem is one of such problems that many solutions have been proposed for. The traditional methods to solve this problem are exponential in terms of runtime and are not acceptable in terms of space and memory complexity. In this study, parallel genetic algorithms are proposed to solve...
متن کاملComprehensive Parametric Study for Design Improvement of a Low-Speed AFPMSG for Small Scale Wind-Turbines
In this paper, a comprehensive parametric analysis for an axial-flux permanent magnet synchronous generator (AFPMSG), designed to operate in a small-scale wind-power applications, is presented, and the condition for maximum efficiency, minimum weight and minimum cost is deduced. Then a Computer-Aided Design (CAD) procedure based on the results of parametric study is proposed. Matching between t...
متن کامل